Patent Application 

A METHOD FOR GENERATION AND ASSEMBLY OF WEB PAGE CONTENT 

BACKGROUND OF THE INVENTION 
1. Field of the Invention : 

5 The present invention relates to the generation and assembly of Web page 

content, and more particularly, to the dynamic inline assembly of a Web page from 
textual content. 



2. Discussion of the Prior Art : 

10 The growth of the World Wide Web (Web) has been accelerated by advances in 

computing power and the burgeoning availability of bandwidth. Content providers can 
develop voluminous and complex content which may by transmitted from a host to a 
client in a relatively short time. However, the level of demand for this content often 
outstrips the ability of service providers and hosts to effectively support the content, 
1 5 leading to temporary shortages in Internet service. Within the evolving framework of the 
Web, dynamically assembled Web pages have been developed, in part, to satisfy the 
demand for content while reducing the impact on the service providers and hosts. 

Dynamically assembled Web pages present content in the form of uniquely 
addressable elements. These elements may be obtained from different hosts across the 
20 network in a just-in-time fashion. For content providers, dynamic assembly is particularly 
desirable when Web page content changes frequently (e.g., stock prices) or is targeted to 
an end-user (e.g., personalized greetings). The advantages of dynamic assembly include, 
inter alia, content creation efficiencies and network transmission efficiencies. Content 
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creation efficiencies may be realized when elements are created once and reused. 
Network transmission efficiencies can be realized when elements exhibiting varying 
lifetime attributes are independently stored and cached at locations close to the end-user. 
Web pages may be characterized as static or dynamic. A static page includes 
5 content that is fixed over time, thus, each time the content is called by a browser, the 

same content is displayed. A dynamic page includes content which is generated upon 
being called. Therefore, dynamic pages can provide different content over time. 
Currently, prevalent Web browser implementations constrain the dynamic assembly of 
Web pages since, with the exception of HTML Frames, browsers do not support 
1 0 embedded or inline references to HTML elements. In other words, Web Browsers do not 
support embedded addressable HTML tags within another HTML element. 

HTML Frames provide a means by which Web pages may be assembled from 
uniquely addressable HTML elements. However, the adoption of HTML Frames has 
been limited due to incompatibility in the implementation of Frames support between 
1 5 various Web Browsers and between different releases of the same Web browser. 

Furthermore, Frames need display coordinates or percentages to be pre-allocated. 
Accordingly, if the available display real-estate is not filled it may be wasted. 

At present there are Web sites which dynamically assemble uniquely addressable 
elements to build Web pages for presentation to a user. However, these Web sites 
20 perform page assembly at a centralized server, or on the host side of the network, for 

transfer to the client as an atomic unit. Although Web page assembly is based on element 
usage and reuse, server side assembly does not result in greater content creation 

YOR9-200 1 -028 1 US 1 (8728-5 1 3) 2 



efficiency or greater network efficiency since Web pages are still transmitted as atomic 
units through the network to the end-user's Web browser. 

Therefore, a need exists for a method of dynamic client-side Web page assembly 
using inline references to images including textual content. 



SUMMARY OF THE INVENTION 

According to an embodiment of the present invention, a method is provided for 
processing an HTML document. The method includes obtaining textual content, 
10 converting a portion of the textual content to a media file, storing the media file, and 

providing a tag within the HTML document referencing the stored media file, wherein 
the tag references the media file including the portion of the textual content. The media 
file may be a sound file or an image file. 

An expression of an idea within the textual content is the property of an entity. 
15 The image file can include a watermark. The watermark serves as a notice of the 

existence of a property right. 

The method includes compressing the media file according to a compression 

preference. 

The compression preference identifies a file format. 
20 The media file is stored in one of a client-side database and a networked cache. 

The method further includes generating an image map corresponding to the image 
file, according to a mapping preference. The mapping preference relates selectable spatial 
display coordinates to external document identifiers in order to enable user navigation. 
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Converting the textual content to the media file eliminates the use of scripting 
commands for presenting the textual content. 

According to an embodiment of the present invention, a method is provided for 
converting electronically encoded document content from a text format to an image 
5 format. The method includes obtaining the electronically encoded document, determining 
a content creation preference, and extracting content from the electronically encoded 
document. The method further includes converting the content from the text format to the 
image format according to the content creation preference, and performing one of a 
transfer of the content in image format to a database and a transfer of the content in 
10 image format to a client. 

The method includes applying a watermark to the content in the image format 
according to a watermarking preference. The method includes compressing the 
watermarked content in the image format according to a compression preference. 

According to an embodiment of the present invention, a method for dynamically 
15 converting electronically encoded HTML content from a text format to an image format. 

The method includes receiving a request for the content from a client, obtaining the 
content in text format, determining a content creation preference, and converting the 
content in text format to content in the image format according to the content creation 
preference. 

20 The method further comprises the step of extracting the content in text format 

from the obtained content in the text format. The method includes replying to the request, 
wherein the reply includes the content in the image format. 
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The method applies a watermark to the content in the image format according to a 
watermarking preference. The method includes compressing the watermarked image 
according to a compression preference. 

The method generates an image map according to a mapping preference, wherein 
5 the image map relates selectable spatial display coordinates to external document 
identifiers. 

The method includes storing the content in the image format. The method further 
includes generating an HTML document containing a reference to the stored content in 
the image format for retrieval and inline dynamic assembly by the client. 

I o The method comprises receiving a client system request for verification of the 

watermarked content, verifying the watermarked content, and replying to the request for 
verification. The method includes receiving a client system request for verification of the 
watermarked content, determining whether a plurality of watermarking verification 
elements are present in the watermarked content, and determining whether a plurality of 

1 5 watermarking verification elements are current in the watermarked content. The method 
includes requesting the plurality of watermarking verification elements from a host via a 
secure communications link, receiving the requested watermarking verification elements 
from the host via the secured communications link, and storing the watermarking 
verification elements in storage local to the client system. The method obtains the 

20 watermarked image, verifies the watermark, and presents a verification result to the client 
system. The host is remote to the client. The storage local to the client includes one of a 
hard drive connected to the client and a networked cache. 
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According to an embodiment of the present invention, a program storage device is 
provided readable by machine, tangibly embodying a program of instructions executable 
by the machine to perform method steps for processing an HTML document. The method 
includes obtaining textual content, converting a portion of the textual content to a media 
5 file, storing the media file, and providing a tag within the HTML document referencing 
the stored media file, wherein the tag references the media file including the portion of 
the textual content. The media file is an image file. The image file includes a watermark. 

The method steps further includes the step of generating an image map 
corresponding to the image file, according to a mapping preference. 

10 

BRIEF DESCRIPTION OF THE DRAWINGS 

Preferred embodiments of the present invention will be described below in more 
detail, with reference to the accompanying drawings: 

Fig. 1 is an illustration of a network including a client and Web server according 
15 to an embodiment of the present invention; 

Fig. 2 is a flow diagram of the method of off-loading content to a client according 
to an embodiment of the present invention; and 

Fig. 3 is an example of an image according to an embodiment of the present 
invention. 

20 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

According to an embodiment of the present invention, a Web page can be 
assembled at an end-user's Web browser or client using Hypertext Markup Language 
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(HTML) instructions including embedded tags referencing dynamic assembly images. 
The present invention has several advantages over the prior art. For example, server 
systems are relieved of the content creation costs related to the assembly of the Web page 
on behalf of the client. Further, network resources can be utilized more effectively by 

5 storing elements used to build the Web page closer to the client, for example, elements 

may be cached or stored on the client's local hard drive. In addition, the elements may 
cached throughout a distributed system, thus significantly reducing the volume of 
network traffic associated with the retrieval of the elements. 

It is to be understood that the present invention may be implemented in various 

10 forms of hardware, software, firmware, special purpose processors, or a combination 

thereof. In one embodiment, the present invention may be implemented in software as an 
application program tangibly embodied on a program storage device. The application 
program may be uploaded to, and executed by, a machine comprising any suitable 
architecture. Preferably, the machine is implemented on a computer platform having 

15 hardware such as one or more central processing units (CPU), a random access memory 
(RAM), and input/output (I/O) interface(s). The computer platform also includes an 
operating system and micro instruction code. The various processes and functions 
described herein may either be part of the micro instruction code or part of the 
application program (or a combination thereof) which is executed via the operating 

20 system. In addition, various other peripheral devices may be connected to the computer 
platform such as an additional data storage device and a printing device. 

It is to be further understood that, because some of the constituent system 
elements and method steps depicted in the accompanying figures may be implemented in 
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software, the actual connections between the system elements (or the process steps) may 
differ depending upon the manner in which the present invention is programmed. Given 
the teachings of the present invention provided herein, one of ordinary skill in the related 
art will be able to contemplate these and similar implementations or configurations of the 
5 present invention. 

While current implementations of Web Browsers do not support embedded or 
inline references to HTML elements, they do support inline references to images. For 
example, browsers typically support inline Graphics Interchange Format (GIF) images, 
inline Joint Photographic Experts Group (JPEG) standards images, and Portable Network 
1 0 Graphics (PNG) images. Thus, an image may be dynamically retrieved and presented to 
the user by embedding an image tag, or <IMG> tag, within an HTML document being 
processed by the Web browser or client. The <IMG> tag may be expressed as: <IMG 
SRC="xxxx"> where xxxx represents an image's unique identifier, for example, a 
Uniform Resource Locator (URL) pointing to an image file on a network (e.g., the 
1 5 Internet). Image files may be inserted within an HTML document using <IMG> tags. 

It should be noted that while it is possible to encode an HTML page such that it 
contains <IMG> tags referencing HTML files (e.g., <IMG SRC="xxxx.html">), the 
behavior of current Web Browsers is such that the file will not be processed when called. 
Rather, the browser may present a broken-image icon to the end-user. 
20 In addition to the efficiencies of client side dynamic assembly of web pages, 

imagery is less vulnerable to adoption and/or modification by third parties than textual 
content. Therefore, a distributor can exercise greater control over Web page content by 
converting textual content to an image format prior to distribution. 
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According to an embodiment of the present invention, imagery can be digitally 
signed with a watermark. The addition of a watermark to a digital image servers to notify 
any party viewing or using the content of the existence of copyrights in the content. For 
example, in U.S. Pat. No. 5,530,759 entitled COLOR CORRECT DIGITAL 

5 WATERMARKING OF IMAGES, to G.W. Braudaway et al., a system for placing a 

visible watermark on an image is disclosed to discourage unauthorized use of the image. 
Invisible watermarking techniques may also be used in conjunction with the present 
invention. An invisible watermarking technique is disclosed in U.S. Pat. No. 5,875,249, 
entitled INVISIBLE IMAGE WATERMARK FOR IMAGE VERIFICATION, to F.C. 

10 Mintzer et al. The invisible watermarking system and method verifies image content 
using a stamping process and a verification key. The key is used to extract stamping 
information from the image. The extracted stamping information can be verified 
manually and/or automatically. In U.S. Pat. No. 5,488,664, entitled METHOD AND 
APPARATUS FOR PROTECTING VISUAL INFORMATION WITH PRINTED 

15 CRYPTOGRAPHIC WATERMARKS, to A. Shamir, a method and device is disclosed 

for protecting visual information against unauthorized access and modification using a 
printed cryptographic watermark which is encoded using black and white pixels. These 
patents (5,530,759 5,875,249 and 5,488,664) are examples of watermarking techniques 
including methods of watermark verification and are herein incorporated by reference in 

20 their entirety. The watermark itself can be explicitly displayed and perceptually 
noticeable in the textual imagery. 

Thus, the present invention facilitates copyright protection for textual data, 
whether dynamically assembled or statically prepared, by providing a method for 
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converting textual content to an image format which can then be watermarked. Textual 
imagery incorporating a digital watermark may be used in conjunction with an electronic 
copyright management system. Electronic copyright management systems function as 
clearing houses for digital content and the associated intellectual property rights. Typical 
5 electronic copyright management systems include a database of information relating to 

individual works, for example, the year of publication, the name of the copyright owner, 
etc., and a system of selling rights to those works. According to an embodiment of the 
present invention, by providing notification information in a watermark, the identification 
of copyrighted works and rights holders can be streamlined. 
10 Referring to Fig. 1, a client's web browser 102 is connected to a networked cache 

106 and a Web server 108 through the Internet 104. The Web server 108 includes a 
dynamic application 1 10 for retrieving client requested text from a database 1 12 and 
converting the text to imagery 1 14. The conversion may be according to a preference 
118. The preference 1 18 is a content creation preference including attributes of the text, 
1 5 including font, font size, color, margins, etc. Content creation preferences may be used in 

the absence of other preferences, such as, presentation directives and attributes. 
Presentation directives, including for example, style sheets, may be used to provide 
formatting information specific to a particular HTML document. The Web server may 
watermark the imagery 116 with visible or invisible watermarks according to a 
20 watermarking preference 120. The watermarking preference 120 may specify attributes 
including, transparency of the watermark, the watermark image, etc. The web server 
delivers the requested text in the form of an image to the Web browser 102 through the 
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Internet 104. The Web browser may cache the requested text on a local hard drive or on a 
network cache 106 across the Internet. 

The present invention describes methods for building a Web page according to 
tags embedded within HTML elements. The Web page can be assembled dynamically by 
5 a Web browsers or client supporting HTML links to imagery. The method implements 
embedded references to dynamic assembled images including textual content within 
HTML pages. The method provides a means by which textual content can be converted 
to an image and packaged as a uniquely addressable HTML element. In addition, image 
maps may be used in conjunction with the imagery including textual content, enabling 
10 user navigation within a documents and to other Web pages. Image maps associate 

.1 ' ' ; 

regions of an image (defined by x andj> coordinates) to HREF links, or hyperlinks, which 

may be used to navigate a document and/or the Web. 

Each image may include a digital watermark. To increase the network efficiency, 

the imagery may be compressed. Methods of compressing images are well known in the 
15 art and include, for example, image compression according to the JPEG standard, GIF 

image compression, or PNG compression. 

For image compression, GIF and PNG compression algorithms are lossless, 

however, according to an embodiment of the present invention, lossy compression (e.g., 

JPEG) may be used. Typically, compression of textual content needs to be lossless to 
20 avoid altering the meaning of the text. However, according to the present invention, 

because the textual content is converted into imagery, errors introduced by the 

compression technique may not alter the meaning of the text. Therefore, a more 
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aggressive compression algorithm, one which may introduce errors into the file, may be 
implemented to compress the imagery. 

Referring to Fig. 2, the Web server receives a request of dynamic text 202 and 
retrieves the text 204 from a database or host. The text may be converted to imagery 206 
5 and/or speech 208 (a sound file). The image of the text can be associated with an image 

map 212 for linking to anchors within the image or to other web pages. The image may 
be watermarked 210 and compressed 216. Likewise, methods of watermarking speech 
may also be implemented 214. Various sound formats may be used to compress the 
watermarked speech 218 including, wave files (.wav) and Audio Interchange File Format 

10 (.aif). The compressed imagery and/or sound file can be directed to the client 220 along 

with any corresponding image map. 

According to an embodiment of the present invention, a watermark may be 
verified. A Web browser including a program, such as a plug-in, applet, etc., may 
examine the watermark and provide verification information. Verification information 

15 includes, for example, the presence of a watermark, the date the watermark was posted, 

the integrity of the watermark, and information about the content provider and the 
watermark. 

The present invention contemplates the encryption of content. Static and dynamic 
content may be encrypted for restricted end user use. In addition, particular content 
20 referenced by an HTML document, for example, an individual image file, may be 

encrypted while other content is unencrypted. For example, encrypted content may be 
transmitted via a Secure Socket Layer (SSL) and Secure Hypertext Transfer Protocol 
(HTTPS) while other elements of an HTML document are transmitted via Hypertext 
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Transfer Protocol (HTTP). A tag which references content may indicate whether the 
content is encrypted, for example, that the content referenced is to be transmitted via a 
secure communications means such as HTTPS or SHTTP, a security-enhanced version of 
HTTP. HTTPS and SSL support the use of digital certificates. Therefore, a client may 
5 authenticate the sender or host. Additional methods of encryption are contemplated by 
the present invention, these methods will be obvious to one of ordinary skill in the 
relevant art. 

The mechanics of converting textual content to an image format, for example, the 
JPEG image standard, are provided in Fig. 3 and accompanying Appendixes 1 and 2. 

10 Appendixes 1 and 2 are provided as an example of an embodiment of the present 

invention and given the teachings of the present invention provided herein, one of 
ordinary skill in the related art will be able to contemplate these and similar 
implementations or configurations of the present invention. 
Appendix 1 is the HTML code corresponding to Fig. 3. Appendix 2 includes a Java 

1 5 servlet for converting textual content to imagery. The servlet simulates a database lookup 

based on request parameters and returns a JPEG image created from textual content 
according to a set of parameters. The present invention has been written and tested on 
hardware implementing RedHat Linux 6.2 and IBM's Java 1.3.0, build cxl30-20000815. 
The servlet engine used for the test was a Tomcat 3.1 engine from the Jakarta project at 

20 http://jakarta.apache.org. 

Fig. 3 is an example of a browsers interpretation of an HTML document including 
both text and images including textual content. The images 304-310 created from textual 
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content are in a different font to make it easier to visually differentiate the raw text, e.g., 
302, from the images 304-310. 

According to the present invention, the conversion of textual content to an image 
format may increase the amount of data sent through the network. This expansion may 
5 occur when small amounts of textual content are converted to images. However, this 
expansion is not linear. As more text is converted to imagery, the volume of data 
representing the images may not increase significantly. Further, by converting from 
textual content to an image format, the need for scripting commands, including style 
sheets, may be eliminated. By eliminating scripting commands the volume of imagery 

10 data to be distributed may be reduced as compared to textual data. Appendixes 3 and 4 
show the HTML code for a Web site including the results of single tennis match. 
Appendixes 3 and 4 are provided as an example of an embodiment of the present 
invention and given the teachings of the present invention provided herein, one of 
ordinary skill in the related art will be able to contemplate these and similar 

15 implementations or configurations of the present invention. Appendix 4 shows the 

HTML code for the Web site using <IMG> tags to imagery including the textual content 
of Appendix 3. In addition, Appendix 4 includes an image map linking different areas of 
the image "tennis.jpg" to additional HTML documents within the Web site. 

According to an embodiment of the present invention, the processing load on the 

20 client may be significantly reduced. Currently, Web Browsers need to devote significant 
processing power to the parsing and rendering of HTML text. By presenting textual 
content to the client in the form of an image, the overhead associated with parsing and 
rendering may be reduced or eliminated. 

YOR9-2001-0281US1 (8728-513) 14 



According to an embodiment of the present invention, the attributes of the content 
can be assured regardless of the client's implementation. Currently, different client 
implementations, such as, Microsoft's ® Internet Explorer and Netscape's ® Navigator, 
may interpret markup tags and style sheets in conflicting manners. Different clients may 
5 present the same textual content quite differently. It is a challenge to the developer of 
content to account for the differences in rendering between the variety of 
implementations. According to an embodiment of the present invention, difficulties 
caused by the differences between browsers are avoided by providing content to the 
client in the form of an immutable image which does not require modification or 
1 0 interpretation by a browser. 

Through the implementation of a digital signature, the present invention notifies 
readers of a signed document of the existence of intellectual property rights, including 
but not limited to, those rights due to a work under §§401-414 of Title 17 United States 
Code. The digital signatures may be, among other things, a watermark in the text based 
1 5 imagery. The present invention describes a method by which textual content is 

transformed to an image format suitable for dynamic assembly by a Web browser. The 
present invention also describes methods by which textual data may be copyright 
protected, whether dynamically assembled or statically prepared, by providing a means to 
dynamically convert textual content to an image format including a digital watermarks. 
20 According to yet another embodiment of the present invention, textual content 

may be converted to aural content. Aural content may be provided in conjunction with 
imagery to enhance the depth of content provided, and making Web page content 
accessible to sight impaired users. Methods of converting text to speech are well known 
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in the art. Various methods of sound compression may be used, for example, .wav files 
and MP3's (MPEG-1 Audio Layer-3), to reduce the load on networking facilities. The 
present invention provides for watermarking speech. Speech may be watermarked by 
using, for example, explicit copyright information in the speech or incompressible sound 

5 (e.g., out of the hearing range of the user or background noise) which may be detected 
using a computer program. 

Having described embodiments of a method and system for creation of 
dynamically assembled web pages, it is noted that modifications and variations can be 
made by persons skilled in the art in light of the above teachings. It is therefore to be 

10 understood that changes may be made in the particular embodiments of the invention 

disclosed which are within the scope and spirit of the invention as defined by the 
appended claims. Having thus described the invention with the details and particularity 
required by the patent laws, what is claims and desired protected by Letters Patent is set 
forth in the appended claims. 

15 
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